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Abstract 

In this short expository article, we describe a mathematical tool called 
the probabilistic method, and illustrate its elegance and beauty through 
proving a few well-known results. Particularly, we give an unconventional 
probabilistic proof of a classical theorem concerning the Turan number 
T(n,k,l). 

1 Introduction. 

Probabilistic combinatorics is a field in combinatorics that was pioneered by 
one of the greatest mathematicians of all time, Paul Erdos. It utilizes notions 
from probability theory (mainly discrete probability theory) to prove existence 
results; for instance, it has provided solutions to the following questions. 



For any graph G, can we find a bipartite subgraph which contains at least 
half of the edges of Gl 



• Given a family J- of subsets of [n] = {1, 2, if we know its size, can 

we find two elements A,BgJ~ such that A C Bl 

As a matter of fact, we will provide an answer to the second question above. 

Mathematicians describe the tools of probabilistic combinatorics using the 
generic term "the probabilistic method." There is a wide range of techniques 
in probabilistic combinatorics, and the interested reader should consult [2]. Its 
diversity notwithstanding, we shall introduce the fundamental ideas of the prob- 
abilistic method; perhaps the most straightforward and unambiguous way to do 
so is to solve a problem. The following theorem, due to Mantel (1907) [4], tells 
us that in any graph, if it contains many edges, then it contains a triangle. Note 
that for illustrative purposes, we shall also present a non-probabilistic proof of 
the theorem. 

Theorem 1.1 (Mantel's Theorem) In any graph G(V,E) with \V\ — n, if 
G does not contain any triangle, then 

, n 2 
E < — . 
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Proof (non-probabilistic). Note: In this proof, we use A(G) to denote the 
maximum degree among the vertices ofG. 

Label the vertices V2, v n . Among these vertices, pick a vertex with 
degree A(G) (there may be more than one such vertex); without loss of gener- 
ality, let this vertex be v p (for 1 < p < n). If deg(v p ) — k, then let the set of k 
neighbours of v p be P. If there exists a pair of vertices that are connected by 
an edge in P, then we are done. If not, each vertex in P has degree at most 
n — 1 — (k — 1) = n — k. Moreover, the n — k — 1 vertices in V \ P each have 
degree at most k. Therefore, we have 



2\E\ = deg(v 4 ) < k + (n - k)k + (n - k - l)k = (2n - 2k)k. 

i=l 

This gives us \E\ = (n - k)k < ( ^ n - k )+ k ^ 2 = nL^ w here the inequality 

follows from the Arithmetic- Geometric Mean inequality. Since \E\ G N, we 
have 

\E\ < 

as desired. ■ 

Proof (probabilistic). Note: In this proof, a clique on k vertices is simply a 
set of k vertices in which every two vertices in it are connected by an edge. 

Consider a probability distribution p\, ...,p n on the vertex set V of G. Let us 
pick two vertices u, v independently and at random (allowing repetition). Then 

P({u, v}eE)= PiPi- 

l,j:{l,j}eE 

We first assume that the probability distribution is uniform, i.e., p\ = p 2 = 
• • • = p„ = I. This gives us 

F({u,v}eE) = ^. 



Now, we modify the distribution to make P({u, v} G E) as large as possible. 
This happens when the probability distribution that maximizes the probability 
is uniform on some maximal clique. Indeed, suppose that there are two non- 
adjacent vertices i,j such that Pi,pj > 0; let S{ = X^fe-{i k}eE P k an d s i = 
Sfc-{j k}eE P k - ^ Si — s j ( res P- s i < s j)i we se t the probability of vertex i 
to pi + pj and the probability of vertex j to zero (and conversely if Sj < Sj) . 
This increases F({u,v} G E) by Sifa + Pj) — (PiSi + PjSj) = Pj(st — Sj) (resp. 
Pi(sj - s^). 



2 



Since the process is finite, we eventually reach a situation where there are no 
two non-adjacent vertices of positive probability (piPj — as long as {i, j} E), 
i.e., the probability distribution is on a clique Q. Then, 



P({u,v} G E) = ¥(u ^ v) = 1 



Erf- 



Assume there is no clique larger than 2 in size (if not, then there would 
be a triangle violating our assumption). By Jensen's Inequality on the convex 
function f{z) = z 2 , the above expression is maximized when pi is uniform on 
Q, i.e., pi = , which implies 



{u,v} e E) < l 



< 1 



1 

2' 



assuming that there is no clique larger than 2 in size (if not there would be a 
triangle, violating our assumption). Since we started with P({u,w} G E) = 2-£i 
and never decreased it in the process, we yield 



' ' - T 



\E\ < 



n 

T 



as desired. 



In fact, Theorem 1 1 . 1 1 can be generalized to the same problem for cliques of 
size k, where k > 3 (for that matter, Mantel established the result for k = 2). 
The general problem was solved by the Hungarian mathematician Paul Turan, 
and a wonderful probabilistic proof of it can be found in [I] . 



2 Further Examples. 

Recall the second question in the Introduction regarding the size of a family of 
sets of [n]. Intuitively, if the size of the family, |.F|, is large enough, then we 
should be able to hnd some A, B G T such that A C B. Indeed, the following 
theorem of Sperner gives us a sufficient condition on J- such that we can hnd 
two sets such that one contains the other. 

Theorem 2.1 (Sperner's Theorem) If J 7 is a Sperner family of subsets (or 
an antichain) of [n] (i.e., for all A, B G T we have A <f_ B and B <f_ A), then 




Proof. Without loss of generality, let T = {A\,A2, ...,A\jrt}. Consider any 
Ai G T with \Ai\ = aj. Randomly and uniformly pick any permutation of [n]. 
Denoting by Ei the event that the elements in Ai appear in the first ai numbers 
in the permutation, we get 
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(aOKn-oQl 1 1 

Moreover, the Ei are mutually exclusive because E^ (~l £y is the event that 
both Ai and Ay appear at first in the permutation, implying that one of Ai C Aj 
or Aj C Ai holds. This contradicts our hypothesis. Therefore, summing up our 
probabilities and noting that probabilities are always less than 1, we obtain 



\i=l ) i=l Va 4 / 



' n \ ' 



where the last inequality follows from the fact that Q) is maximized when 
p — If J . The result follows. ■ 

Next, we have a classical result that is related to the Turan number T(n,k,l). 
We shall present a probabilistic proof which differs from the conventional non- 
probabilistic proof (see for instance [3J chapter 1]) of the theorem. 

Theorem 2.2 The Turan number T(n,k,l), where n > k > I, is the smallest 
number of l-element subsets of an n-element set X such that every k-element 
subset of X contains at least one of these sets. If T{n, k,l) — \J-\, where J- is 
the smallest family satisfying the above conditions, then 

' '-(?) 

Proof. Let us randomly select without replacement a fc-element subset K from 
X. Let A be the event that the first I elements picked form a set L which 
belongs to T . Then 



P{A) 



\T\ 



I - 1 
n - 1 



n — I + 1 n — I 



-k+1 



(?)■ 



On the other hand, let us consider the events Ei where Ei is the event that 
the ith fc-element subset was selected. Then we have 



(t) 



CO 



¥(A) 



i i 



since each fc-element subset of X contains at least one element in T . Combining 
the two equations yields the desired inequality. ■ 
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3 Conclusion 



Truly, the probabilistic method presents a pristine and elegant approach to 
problem solving. From its very first applications in extremal graph theory by 
Paul Erdos, to its plethora of applications (see [HI3]) m coding theory, number 
theory and geometry, probabilistic combinatorics today has certainly emerged 
as one of the richest subdisciplines of mathematics where beauty, creativity and 
rigour converge. In Erdos' own words, "This one's from The Book!" 
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